
PACKET SWITCH AND PACKET SWITCHING METHOD USING 
PRIORITY CONTROL BASED ON CONGESTION STATUS 
WITHIN PACKET SWITCH 



BACKGROUND OF THE INVENTION 
FIELD OF THE INVENTION 

The present Invention relates to a packet switch and a 
packet switching method for switching packets between a 
plurality of ports. 



DESCRIPTION OF THE BACKGROUND ART 

In recent years, there are considerable progresses in 

15 the field of data communications. Traditionally, the most 
representative communication network has been a telephone 
network, but the importance of a data communication network 
such as Internet is increasing rapidly due to the spread of 
PCs for business use and home use. Recently a technique for 

20 realizing telephone service on the data communication 

network has also been developed by taking advantages of low 
cost and high efficiency of the data communication network, 
and there is even a discussion of amalgamation of the 
telephone network into the data communication network. 

25 Now, the data communication networks are mostly packet 

switching networks (which are to be construed as including 
ATM switching networks) that carry out communications of 
data to be transmitted to which information indicating a 
destination is attached, that is, packets (which are to be 

30 construed as including cells in ATM. As the traffic amount 
of the data communications increases, there is an 
Increasing demand for a larger scale packet switch for 
switching these packets. 

There are many propositions for a configuration of a 

35 packet switch, which can be largely classified into two 
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types. One Is a multiple stage switch type in which the 
number of ports can be made large, which is a rather old 
type, and the other is a single stage switch type in which 
the number of ports is small, which is a relatively new 
5 type . 

The single stage switch type packet switches are often 
equipped with a congestion control mechanism In^'order to 
prevent a congested output port from affecting flows of 
packets destined to the other non-congested output ports. 

10 In contrast, the multiple stage switch type packet switches 
presuppose that the traffic characteristics ( rate ) are 
determined in advance, so that a severe congestion has not 
been taken into consideration in most of them. It appears 
that this is also largely due to the fact that it has been 

15 difficult to implement a mechanism for dealing with the 
congestion in the multiple stage switch type which 
presupposes the extension to the large number of ports, 
because the congestion occurs as a result of interplay 
between a plurality of input ports and a plurality of 

20 output ports. 

In the Internet which is the most representative data 
communication network, a rate by which each user transfers 
packets is not determined in advance, so that the 
congestion can possibly occur anyivhere at any time. For 

25 this reason, in designing an architecture of a packet 
switch, techniques for dealing with the congestion are 
equally important as techniques for realizing the large 
scale implementation. The techniques for dealing with the 
congestion include a technique for notifying a user so as 

30 to have the packet transfer rate lowered by the user when 
the congestion occurs within the packet switch, and a 
technique for preventing the occurred congestion from 
affecting flows of packets that are unrelated to that 
congestion. 

35 In the following, such a conventional multiple stage 
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switch type packet switch will be described briefly. 

In the case of the multiple stage switch type packet 
switch, a large scale packet switch can be obtained by 
combining smaller switching elements in multiple stages 
5 (see Joseph Y. Hui : "Switching and Traffic Theory for 

Integrated Broadband Networks", ISBN 0-7923-9061-X, Kluwer 
Academic Publishers, 1990, sections 5.5 and 6.3r. 

Fig. 1 shows an exemplary configuration of a multiple 
stage switch type packet switch of 27 inputs and 27 outputs 

10 obtained by combining switching elements 145 each having 3 
inputs and 3 outputs. 

A packet entered from an input port 121 is transferred 
from an input side transfer unit 102 via a switching unit 
104 to a desired output side transfer unit 106, and 

15 outputted from there to an output port 161. This multiple 
stage switch type packet switch can be extended to a lager 
packet switch by Increasing the number of switching 
elements and the number of stages. Also, when each 
switching element is replaced by a larger switching element 

20 of 8 inputs and 8 outputs, for instance, a large scale 
packet switch can be formed using lesser number of 
switching elements. 

A well known configuration of the multiple stage 
switch type packet switch is that in which a random network 

25 and a routing network are connected in cascade connection. 
A random network 141 and a routing network 143 of Fig. 1 
are coupling networks that are line symmetric with respect 
to each other. In each network, any input link can reach to 
arbitrary output link, through a unique route. By 

30 connecting the random network and the routing network in 
cascade connection, there are as many routes from a given 
input side transfer unit 102 to a given output side 
transfer unit 106 as the number of switching elements on a 
border between the random network and the routing network 

35 (nine in Fig. 1) . 
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A packet outputted from the input side transfer unit 
102 is transferred to a randomly selected link so as not to 
collide with other packets within the random network 141 
and reaches to an input of the routing network 143. This 
5 random transfer is done for the purpose of distributing 

packets probabilistically to inputs (nine in Fig. 1) of the 
routing network 143. The packet is then transferred 
according to its destination within the routing network 143 
and reaches to the output side transfer unit 106. 

10 Inside the routing network 143, there is a possibility 

for a packet collision to occur as a result of having a 
plurality of packets simultaneously transferred towards one 
and the same output link of some switching element. If they 
are packets destined to the same output side transfer unit 

15 106, they are bound to collide, and even if they are 

packets destined to different output side transfer units 
106 they may still collide. In a packet switch in which 
each switching element does not have a queuing buffer , when 
the collision occurs, only one of these packets Is 

20 transferred while the other one is iimnediately discarded. 
When a packet is discarded as a result of such a collision, 
the corresponding input side transfer unit 102 re-transmits 
the discarded packet. 

In this way, even when a packet is discarded inside 

25 the switching unit 104, the input side transfer unit 102 
keeps re-transmitting that packet until it is successfully 
transferred to the output side transfer unit 106. 
Consequently, the packet information will never be lost 
inside the switching unit 104. When viewed from outside of 

30 the packet switch, the packet Is discarded only inside the 
input side transfer unit 102 and the output side transfer 
unit 106. 

In the packet transfer control in such a multiple 
stage switch type packet switch, three types of congestion 
35 may occur, each of which will now be described. Here, for 
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units so that a method for appropriately dealing with this 
congestion has been an unresolved problem. This is because, 
when there are many output side transfer units, it has been 
difficult to collect information as to which output side 
5 transfer unit is crowded and which output side transfer 
unit is not, all at once. 

Next, with reference to Fig. 4, the third type of 
congestion will be described. 

This congestion occurs when the rate of the output 

10 port of the output side transfer unit ([C] in an example of 
Fig. 4) is slower than the arrival rate of packets destined 
to that port. This congestion will be referred to as 
"inside output side transfer unit congestion". In an 
example shown in Fig. 4, when the output side transfer unit 

15 [C] is congested, it is actually meaningless for the input 
side transfer unit [A] to transfer packets destined to the 
output side transfer unit [C] unnecessarily frequently, and 
it suffices to transfer at about the rate of the output 
port of the output side transfer unit [C] . 

20 However, the conventional packet switch could have 

transferred excessively many packets to the same output 
port, and this in turn induces the first type congestion 
(inside routing network collision) and the second type 
congestion (output side transfer unit input congestion) 

25 described above. This completes the description of the 
three types of congestion. 

As described, the conventional multiple stage switch 
type packet switch has an advantage in that it can be 
formed in a large scale by combining switching elements, 

30 but there has been no satisfactory method for dealing with 
the severe congestion. In other words, there has been a 
possibility for having a large number of packets destined 
to the congested port transferred within the switching 
unit. These are wasteful transfers for which the 

35 possibility of being discarded within the switching unit is 
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high. The transfers in excess of the transfer rate of the 
output port is actually meaningless, and It could cause a 
situation in which a large number of packets that are 
transferred in excess are discarded within the switching 
5 unit and re-transmitted repeatedly. In such a situation, 
even packets destined to non-congested ports may collide 
with packets destined to the congested port witRin the 
switching unit such that the discarding and the re- 
transmission are repeated and the flow is degraded. This is 

10 a drawback that can eventually lower the packet transfer 
efficiency of the entire packet switch considerably. 

Thus there has been a demand for the congestion 
control in the multiple stage switch type packet switch in 
which the number of ports can be made large, for the 

15 purpose of application to the Internet or the like in which 
the traffic is increasing rapidly. 

SUMMARY OF THE INVENTION 

20 

It is therefore an object of the present invention to 
provide a packet switch and a packet switching method 
capable of taking the full advantage of the transfer 
capability of the packet switch by avoiding the influence 

25 due to the congestion. 

It is another object of the present invention to 
provide a packet switch and a packet switching method in 
which the flow of packets destined to non-congested ports 
will not be affected by the influence of packets destined 

30 to the congested port. 

It is another object of the present invention to 
provide a packet switch and a packet switching method in 
which once discarded packets can be made less likely to be 
discarded at a time of re-transmission. 

35 It is another object of the present invention to 
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provide a packet switch and a packet switching method in 
which a plurality of packets subdividing a single data can 
be transferred collectively. 

According to one aspect of the present invention there 
5 is provided a packet switch, comprising: a plurality of 

input side transfer units from which packets are entered; a 
plurality of output side transfer units from which packets 
are outputted; a switching unit through which each packet 
entered from each input side transfer unit is switched to a 

10 desired output side transfer unit; a congestion status 

monitoring unit configured to monitor a congestion status 
of each transfer target within the packet switch; a 
priority level attaching unit configured to attach a 
priority level to each packet, according to the congestion 

15 status of a transfer target of each packet monitored by the 
congestion status monitoring unit; and a packet selection 
unit configured to select one packet that is to be 
transferred at a higher priority among colliding packets 
when a packet collision occurs within the switching unit, 

20 according to the priority level attached to each colliding 
packet . 

According to another aspect of the present invention 
there is provided a packet switch. Comprising: a plurality 
of input side transfer units from which packets are 

25 entered; a plurality of output side transfer units from 

which packets are outputted; a switching unit through which 
each packet entered from each input side transfer unit is 
switched to a desired output side transfer unit, the 
switching unit having a configuration in which no packet 

30 collision occurs; a congestion status monitoring unit 
configured to monitor a congestion status of each 
prescribed transfer target; and a connection pattern 
calculation engine configured to attach a priority level to 
each packet, according to the congest ion status of a 

35 transfer target of each packet monitored by the 
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a plurality of computers to function as a packet switch in 
which a packet is transferred from an input side transfer 
unit via a switching unit to a desired output side transfer 
unit, the switching unit having a function of selecting one 
5 packet that is to be transferred at a higher priority among 
colliding packets when a packet collision occurs within the 
switching unit, according to the priority level^attached to 
each colliding packet, the computer readable program codes 
include: a first computer readable program code for causing 

10 said one or a plurality of computers to monitor a 

congestion status of each transfer target within the packet 
switch; and a second computer readable program code for 
causing said one or a plurality of computers to attach a 
priority level to each packet, according to the congestion 

15 status of a transfer target of each packet monitored by the 
first computer readable program code. 

Other features and advantages of the present invention 
will become apparent from the following description taken 
in conjunction with the accompanying drawings. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig- 1 is a block diagram showing an exemplary 
25 configuration of a conventional multiple stage switch type 
packet switch. 

Fig. 2 Is a diagram for explaining one example of an 
inside routing network collision that can occur in a 
conventional packet switch. 
30 Fig. 3 is a diagram for explaining one example of an 

output side transfer unit input congestion that can occur 
in a conventional packet switch. 

Fig. 4 is a diagram for explaining one example of an 
Inside output side transfer unit congestion that can occur 
35 in a conventional packet switch. 
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Fig. 5 is a schematic block diagram showing one 
exemplary configuration of a packet switch according to the 
^ first embodiment of the present invention. 

Fig, 6 is a schematic block diagram showing another 
5 exemplary configuration of a packet switch according to the 
first embodiment of the present invention. 

Fig. 7 is a schematic block diagram showing an 
exemplary configuration of a scheduling unit in a packet 
switch according to the second embodiment of the present 
10 invention. 

Fig. 8 is a schematic block diagram showing an 
exemplary configuration of a packet switch according to the 
third embodiment of the present invention. 

Fig. 9 is a schematic block diagram showing an 
15 exemplary configuration of a packet switch according to the 
fourth embodiment of the present invention. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

20 

Referring now to Fig. 5 and Fig. 6, the first 
embodiment of a packet switch and a packet switching method 
according to the present invention will be described in 
detail . 

25 Fig. 5 shows an exemplary configuration of a multiple 

stage switch type packet switch in the first embodiment, 
along with the outline of the operation of this packet 
switch. 

The basic overall configuration of this packet switch 
30 is similar to that of Fig. 1 in that a packet entered from 
an input port is transferred from an input side transfer 
unit 2 via a switching unit 4 to a desired output side 
transfer unit 6, and outputted from there to an output 
port. The number of input ports and the number of output 
35 ports in this packet switch are arbitrary. Also, a 
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plurality of input ports or output ports may be set in 
correspondence to one input side transfer unit or output 
side transfer unit. The internal network configuration 
inside the switching unit 4 of this packet switch is also 
5 arbitrary, and the number of inputs and the number of 
outputs of each switching element that constitutes that 
network are also arbitrary. Of course, this pacEet switch 
may have the same overall configuration as that of Fig. 1. 
Note that Fig. 5 shows an exemplary case of having four 

10 input side transfer units 2 and four output side transfer 
units 6. Also, Fig. 5 omits a detail of the internal 
configuration of the switching unit 4. 

Here, a "packet" in this embodiment may be data to be 
switched by this packet switch itself, or information to be 

15 transferred inside the switching unit prior to the data 
transfer (the former will be referred to as data packet 
while the latter will be referred to as "request"). 

When a word "packet" Indicates "request", the input 
side transfer unit 2 transmits a request with a destination 

20 information written therein to the switching unit 4 prior 
to the data transfer. In this request, a priority level is 
also written. When the request successfully reaches to the 
destination output side transfer unit 6 or when it is 
judged to have reached there, the data packet corresponding 

25 to this request is transferred from the input side transfer 
unit 2 to the output side transfer unit 6. The request 
reserves a route such that, when the data packet is 
transferred along this route, the data packet can be 
transferred to the output side transfer unit 6 safely 

30 without colliding with another data packet in a middle. By 
transferring a request corresponding to a next data packet 
while transferring a current data packet, it is possible to 
switch data from the input side transfer unit 2 to the 
output side transfer unit 6 efficiently in a pipeline-like 

35 manner. 
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In this embodiment, data to be exchanged itself and a 
request to be transferred prior to the data packet transfer 
will be generically referred to by a word "packet" without 
distinguishing them, because the present invention is 
5 effectively applicable to either case. 

Also, the present invention is equally applicable to a 
packet switch that uses a request (or reserves a route) and 
a packet switch that does not use a request (or does not 
reserve a route) . 

10 The above remarks also apply to the subsequent 

embodiments to be described below as well. 

Now, in the packet switch of this embodiment, at a 
time of transferring the packet from the input side 
transfer unit 2 via the switching unit 4 to the desired 

15 output side transfer unit 6, a priority level is attached 
to each packet, and a priority control among packets to be 
transferred is carried out according to this priority level 
at the switching unit 4. Note that this priority level is 
used only locally within the packet switch. 

20 The priority level is set by accounting for at least 

the congestion status with respect to a transfer target of 
a packet that Is to be reached within the packet switch, 
such as an output side transfer unit, an output port, a 
class or a flow to which the packet belongs that Is defined 

25 by subdividing the output port further, for example. Here, 
the flow is defined by a destination address of the packet 
or its appropriate combination with packet (header) 
information such as a port number, a protocol number, etc., 
for example, which is given by a virtual connection in the 

30 case of ATM communications. At a time of setting the 

priority level, it is also possible to account for other 
information such as a prescribed attribute associated with 
the packet (a service class to which the packet belongs, 
for example) , or a special condition associated with the 

35 packet (such as an indication that it is a re-transmission 
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packet, for example), for example. 

There is no particular limitation regarding: the steps 
of the priority level, because the present invention is 
effectively applicable regardless of the number of steps of 
5 the priority level. 

The present invention is also effectively applicable 
regardless of a location at which the priority level is 
attached to the packet, but this embodiment is directed to 
an exemplary case where the priority level is attached to 

10 the packet at each input side transfer unit 2. 

The present invention is also effectively applicable 
regardless of whether the congestion status is monitored 
inside the switching unit 4 or inside the output side 
transfer unit 6, but the latter is more suitable so that 

15 this embodiment is directed to an exemplary case where the 
congestion status is monitored at each output side transfer 
unit 6 and this information is notified to the input side 
transfer unit 2, 

In order to realize the control based on the priority 

20 level set by accounting for the congestion status, the 

packet switch of Fig. 5 has an exemplary configuration in 
which each input side transfer unit 2 has a priority level 
attaching unit 21 and a congestion level table 12, each 
output side transfer unit 4 has a congestion status 

25 monitoring unit 61, and each switching element 45 has a 
packet selection unit 451. 

Note that it is also possible to adopt a configuration 
in which a single congestion level table is shared by all 
the input side transfer units 2. In such a case, a 

30 congestion level table management unit (not shown) for 
receiving a notification of an information regarding the 
congestion status from the output side transfer units 6 and 
reflecting this information into the common congestion 
level table 12 can be provided. It is al6o possible to 

35 adopt a configuration in which the packet selection unit 
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451 is not provided In some switching elements 45. 

In short, the priority level attaching unit 21 of the 
input side transfer unit 2 has a congestion level table 
management function that receives a notification of an 
5 information regarding the congestion status as monitored by 
the congestion status monitoring unit 61 of the output side 
transfer unit 6 and reflects this Information into the 
congestion level table 12, and a priority level attaching 
function that determines the priority level to be attached 

10 to the packet by accounting for the content of the 
congestion level table 12 when the packet is to be 
transmitted from this input side transfer unit 2. 

The congestion level may be an information regarding 
the notified congestion status itself, or a value obtained 

15 by converting the information regarding the notified 

congestion status. The congestion level may be set in units 
of the output side transfer units, in units of output 
ports, or in units of classes or flows that are defined by 
subdividing the output port further, 

20 As a basic manner of setting the priority level, it is 

possible to set the lower priority level for packets with 
the congestion level is higher and the higher priority 
level for packets with the lower congestion level, by 
referring to the congestion level corresponding to each 

25 packet from the congestion level table 12. 

Note however that it is possible to consider various 
variations regarding a relationship between the congestion 
level and the priority level. For instance, the congestion 
level and the priority level may have a linear relationship 

30 or a nonlinear relationship. Also, the number of steps of 
the congestion level and the number of the steps of the 
priority level may be set to be identical or different. 
Also, as already mentioned above, the priority level may be 
obtained from the congestion level and the other 

35 information. 
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Also, a procedure or a function for obtaining the 
priority level from the congestion level may be defined in 
the same units in which the congestion level is set, or in 
units that are finer than those of the congestion level. 
5 For instance, the priority level may be set in units of 
flows while the congestion level is set in units of output 
ports. 

In short, the packet selection unit 451 of the 
switching element 45 selects a packet to be transferred at 

10 a higher priority by accounting for the priority levels 
attached to packets when they collide. 

As a basic manner of packet selection, it Is possible 
to select a packet having the highest priority level. 
However, it is also possible to select a packet by also 

15 accounting for the other information (such as some kind of 
information that Is described In the packet header) In 
addition to the priority level. 

It is also possible to provide some special selection 
criteria for the case in which a single packet cannot be 

20 selected by the ordinary judgement, as in the case where a 
plurality of packets having the highest priority level 
exist. Alternatively, In the case where a single packet 
cannot be selected by the ordinary Judgement, one of a 
plurality of packets having the highest priority level may 

25 be selected randomly. 

In short, the congestion status monitoring unit 61 of 
the output side transfer unit 4 monitors the congestion 
status of the corresponding output side transfer unit 4. 
The information regarding the monitored congestion status 

30 is notified from the corresponding output side transfer 
unit 4 to the input side transfer unit 2. 

The monitoring of the congestion status may be carried 
out in units of the output side transfer units, in units of 
output ports, or in units of classes or flows that are 

35 defined by subdividing the output port further. 
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As the Information regarding the congestion status 
that is to be notified, it is possible to directly use a 
prescribed measurement value regarding the congestion 
status (such as the number of packets or bytes that are 
5 transferred per unit time, the queue length (the number of 
packets or bytes) of a packet queue, or the number of 
packets or bytes that are discarded from a packet queue, 
for example). It is also possible to use a result of 
classification of the prescribed measurement values into 
10 congested and non-congested according to a single reference 
value, or level values obtained by classifying the 
prescribed measurement values according to plural reference 
values. 

Note that the units of the congestion status 
15 monitoring and the units of the congestion level table 

entry may be made identical (both can be in units of output 
ports, for example), or the units of the congestion level 
table entry may be made coarser than the units of the 
congestion status monitoring (the former can be in units of 
20 output ports while the latter is in units of flows, for 
example) , 

In the following, this embodiment will be described in 
further detail. 

25 First, the case in which a packet collision occurred 

will be described. 

When a packet collision (such as the Inside routing 
network collision) occurred Inside the switching unit 4, 
the switching element 45 at the corresponding location 

30 selects a packet to be transferred at a higher priority by 
accounting for the priority levels attached to the collided 
packets, using the packet selection unit 451, and transfers 
only the selected packet. 

The other packet that is not selected will be 

35 immediately discarded and then re-transmitted by the 
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corresponding input side transfer unit 2 later on, for 
example (in this case, there is a need for the input side 
transfer unit 2 to maintain the transmitted packet for a 
certain period of time in view of the possibility of the 
5 packet re-transmission) . As a method for specifying a 
timing" for the re-transmission, it is possible to use a 
method in which the switching element that discarded the 
packet sends a re-transmission message to the input side 
transfer unit 2, or a method in which the output side 

10 transfer unit 4 to which the packet has reached is to send 
a packet arrival message to the transfer source input side 
transfer unit 2, and the input side transfer unit 2 judges 
that the packet is discarded when the packet arrival 
message from the transfer target output side transfer 

15 unit 4 is not received even after a certain period of time 
has elapsed. 

Alternatively, it is also possible to use a 
configuration in which all or a part of the switching 
elements are equipped with a packet buffer, and the packet 

20 that is not selected is temporarily stored in a queue of 
the packet buffer inside that switching element instead of 
being iimnediately discarded. 

Next, the monitoring and the notification of the 
congestion status will be described. 

25 This embodiment incorporates a feature that the 

congestion status of the route on which the packet Is 
transferred is monitored, and the information regarding 
that congestion status is notified to a portion that 
attaches the priority level to the packet (the input side 

30 transfer unit 2 in this embodiment). As mentioned above, in 
this embodiment, a location for monitoring the congestion 
status is set to be the output side transfer unit 6, which 
is more suitable for this purpose. This point will now be 
described, 

35 If the rate at which packets are outputted from the 
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output side transfer unit 6 to the output port Is lower 
than the rate at which the packets Inflow from the 
switching unit 4 into the output side transfer unit 2. the 
congestion status inside the switching unit 4 will be 
5 reflected in the congestion (the increase of the number of 
packets or bytes that are stored in the output buffer, for 
example) inside the output side transfer unit 6? In other 
words, whenever the output side transfer unit input 
congestion occurs, the inside of the output side transfer 

10 unit 6 is also congested. 

Also, even if the congestion does not occur Inside the 
switching unit 4, there are cases where the congestion (the 
Inside output side transfer unit congestion described 
above) occurs only Inside the output side transfer unit 6. 

15 This congestion occurs more frequently in the configuration 
In which a single output side transfer unit 6 has a 
plurality of output ports. 

From the above, it can be seen that both the output 
side transfer unit input congestion and the Inside output 

20 side transfer unit congestion can be detected by monitoring 
only the output side transfer unit 6. 

Note that the congestion status is monitored in units 
of output side transfer units, output ports, classes, flows 
(virtual connections, for example), or their combination. 

25 Fig. 5 shows an exemplary case of monitoring the congestion 
status for each flow and setting an entry of the congestion 
level table 12 for each flow. 

Now, the congestion status so monitored is to be 
notified to the input side transfer unit 2 in this 

30 embodiment, and there are several possible methods for 

notifying the congestion status to the input side transfer 
unit 2. 

For example, it is possible to use a method In which 
the congestion status is sent along with an ACK/NACK signal 
35 to be transferred from the output side transfer unit 6 to 
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the input side transfer unit 2. An ACK (Acknowledgement) 
sig-nal for notifying that a packet has safely arrived at 
the output side transfer unit 6 and a NACK (Negative 
Acknowledgement) signal for rejecting a packet for some 
5 reason even when a packet arrived at the output side 
transfer unit 6 are signals to be transferred from the 
output side transfer unit 6 to the input side tfansfer unit 
2 even in the conventional packet switch. According to the 
generally well known method, these signals are returned 

10 from the output side transfer unit 6 to the Input side 
transfer unit 2 that transmitted that packet, along the 
transfer route of the packet in the reverse direction. One 
method for notifying the congestion status is to send the 
information regarding the congestion status along with this 

15 ACK/NACK signal. 

As another method, it is possible for each output side 
transfer unit 6 to generate and return a packet for 
notifying the information regarding the congestion status 
whenever a prescribed number (which may be set to 1) of 

20 packets arrive at the unit of monitoring (port, for 
example), or whenever a prescribed period of time has 
elapsed, toward the input side transfer unit 2 that has 
transmitted a packet at that moment or the input side 
transfer unit 2 that has transmitted a packet between that 

25 moment and a time of the previous notification. 

By transferring the congestion status in such a 
manner, it becomes possible to notify the congestion status 
of the output side transfer unit 6 to the input side 
transfer unit 2 easily even in the packet switch with the 

30 large number of ports. 

Note that, in any of the cases described above, when 
the unit of monitoring is set to be a unit (port, for 
example) that is finer than the output side transfer unit, 
the information regarding the congestion status to be 

35 notified may be an information for all the transfer targets 
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which share the same output side transfer unit 6 instead of 
an information for the same transfer target (port, for 
example) as the arrived packet, or an entire information 
for all the output side transfer units 6. 
5 Also, the information regarding all the congestion 

statuses for all the output side transfer units 6 may be 
notified to all the input side transfer units 2T 

Also, as mentioned above, instead of monitoring and 
notifying the congestion status at the output side transfer 

10 unit 6, the congestion status may be monitored at and 

notified from the switching element 45 that is located at 
one or plural stages previous to the output side transfer 
unit 6, or both the output side transfer unit 6 and the 
switching element 45. 

15 Next, the several examples of the packet transfer 

control based on the priority level within the packet 
switch will be described. 

First, the most representative control, that is, the 
priority control to be carried out according to the 

20 congestion status will be described. 

In general, the priority level of a packet is usually 
determined according to a class to which that packet or a 
flow of that packet belongs such as the real time 
Information or the best effort information, for example, 

25 but in this embodiment, this priority level is set 

according to the congestion status (or the priority level 
that is originally assigned as described above is changed) . 

For example, the priority level of the packet destined 
to the congested transfer target (which is assumed to be 

30 the congested port here) is lowered. A packet with a higher 
priority level will be handled at a higher priority even 
when the collision occurs inside the switching unit 4, so 
that packets with lower priority levels destined to the 
congested port are virtually non-existent for the packet 

35 with a higher priority level. 
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Consequently, in this embodiment, there Is an effect 
that the flow of packets destined to the non-congested port 
Is hardly disturbed by the flow of packets destined to the 
congested port. Even for the packet destined to the 
5 congested port, only its priority level is changed and the 
transfer rate is not suppressed, so that it is possible to 
continue to attempt the transfer to the output side 
transfer unit 2 by utilizing an idle time between the 
transfers of the packets with the higher priority levels. 

10 For this reason, it is possible to continue to transfer 

packets at the full transfer capacity of the switching unit 
4, even with respect to the congested port. 

As an alternative to this embodiment, it is possible 
to consider a complicated control in which the transfer 

15 rate of packets destined to the congested port Is to be 
strictly suppressed according to the congestion level 
Instead of using the priority level, but according to this 
embodiment, such a transfer rate suppression control is 
unnecessary so that there is an advantage in that a 

20 structure of the input side transfer unit 2 can be made 
simpler. 

Note that the method for transferring a packet by 
attaching the priority level is already conventionally 
known, but this embodiment essentially differs from such a 
25 conventional method in that the priority level is set or 
changed dynamically according to the congestion level. 

Next, the priority control with respect to the re- 
transmission packets will be described. 

In the packet switch using a scheme in which the input 
30 side transfer unit 2 re-transmits a packet that was 

discarded as a result of the collision inside the switching 
unit 4, if the same packet is discarded consecutively, 
there arises a possibility for affecting the entire flow of 
packets that are waiting for transfers after that packet. 
35 As a method for resolving this problem, it is possible 
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to raise the priority level of the re-transmission packet 
higher than the priority level of the original packet when 
the packet is discarded inside the switching unit 4. With 
this control, there is an advantage that the re- 
5 transmission packet becomes less likely to be discarded 

again- If the re-transmission packet is discarded again, it 
is possible to raise the priority level of the re- 
transmission packet for that discarded re-transmission 
packet even higher. In this way, the unfortunate packet 

10 that has been re-transmitted repeatedly can be transferred 
at a higher priority. 

Next, the priority control with respect to a probe 
packet will be described. 

As in the case of the re-transmission packet described 

15 above, there are other situations where the temporal change 
of the priority level is beneficial. 

For example, in the configuration in which the 
information regarding the congestion status is to be 
notified from the output side transfer unit 6 to the input 

20 side transfer unit 2 that has transmitted the packet at a 
timing of the arrival of the packet to the output side 
transfer unit 6, when the input side transfer unit 2 
transfers a packet to a new destination for the first time 
(or after a sufficiently long period of time during which 

25 the congestion status has likely been changed) , the 

congestion status of its destination cannot be ascertained 
(or the actual congestion status of its destination cannot 
be ascertained) • A packet to be transferred when the 
congestion status of the destination is unknown (or 

30 virtually unknown) as such is mainly significant as a probe 
signal to ascertain the congestion status of the 
destination. Such a packet will be referred to as a probe 
packet here. 

By raising the priority level of this packet for 
35 probing (probe packet) higher, the probe packet becomes 
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less likely to be discarded at the switching unit 4 so that 
the congestion status of the destination can be ascertained 
promptly, and there is an advantage that the priority 
levels of packets to be transferred subsequently can be set 
5 to appropriate values quickly. Although it is sufficient to 
set a high priority level only to the first one packet, it 
is also possible to set a high priority level to the first 
plural packets depending on the convenience of the 
implementation or the like. 

10 Next, the priority control with respect to the divided 

datagram will be described. 

As another example in which it is beneficial to change 
the priority level temporarily, when a datagram that is 
divided into plural portions is to be transferred to the 

15 switching unit 4, it is possible to set the priority level 
of a packet corresponding to the first portion of the 
datagram lower than the priority level of packets 
corresponding to the remaining portions of the datagram. 
Equlvalently , it is also possible to set the priority 

20 levels of packets corresponding to the remaining portions 
of the datagram higher than the priority level of a packet 
corresponding to the first portion of the datagram. For 
instance, the priority level of the top packet can be set 
according to the method of this embodiment, and the 

25 priority levels of the second and subsequent packets can be 
set much higher. 

Also, In the first transfer of the top packet, if the 
top packet failed to reach the output side transfer unit 6, 
the transfer can be repeated by setting the priority level 

30 higher. 

In this way, once the top packet corresponding to the 
first portion of the datagram reaches to the output side 
transfer unit 6, the subsequent portions will also reach 
the output side transfer unit 6 consecutively at a higher 
35 priority, so that it becomes possible to transfer the 
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datagram comprising plural packets to the output side 
transfer unit 6 collectively. In the case where the output 
side transfer unit 6 reconstructs the original datagram 
from the divided plural portions and then output the 
5 reconstructed datagram to the external, there is an 

advantage that the amount of buffer required in order to 
reconstruct the datagram can be made smaller by'^carrying 
out the transfer as described above at the switching unit 
4. 

10 As described, apart from the method for determining 

the priority level of the packet according to the 
congestion level alone, there is also a method for 
determining the priority level of the packet by applying 
the change according to the congestion status, the change 

15 according to the number of re-transmissions, etc., to the 
priority level of the class to which the packet or the flow 
belongs, for example, or a method for determining the 
priority level of the packet by using the congestion level, 
the class, and/or other factors as parameters. 

20 It is also possible to guarantee the packet transfer 

rate within the switching unit 4 by carrying out the 
control to raise the priority level of the packet higher. 
For example, it is possible to provide a service that 
guarantees the minimum value of the transfer rate by 

25 monitoring the flow transfer rate at the input side 
transfer unit 2 and carrying out the control which 
temporarily raises the priority level when the monitored 
transfer rate approaches to a prescribed minimum transfer 
rate. In this service, it is possible to carry out 

30 transfers at a rate above the guaranteed rate when the 

switch is not crowded. In addition, it is also possible to 
attach weights to packets and divide the bandwidth by a 
ratio of weights. 

Note that, when the priority level attaching unit 21 

35 for setting or changing the priority level of the packet is 
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realized by providing a processor inside the Input side 
transfer unit 2 and determining the priority level by 
software, it becomes possible to flexibly handle the change 
in the number of classes to be dealt with or the port 
5 configuration of the packet switch during its operation. 

Next, some variations of the congestion level table 12 
and the priority level attaching unit 21 will b§ described. 

In this embodiment, one of the major features is that 
the priority level of the packet destined to the 
10 corresponding destination is changed according to the 

congestion status notified from the output side transfer 
unit 6. 

As an exemplary realization of this feature, there is 
a method in which the congestion level obtained from the 

15 information regarding the congestion status notified in 

response to the packet transfer is stored in the congestion 
level table 12 of the input side transfer unit 2 that 
transferred that packet, and the priority level of the 
packet destined to the same destination that is to be 

20 transferred from there is set or changed according to the 
congestion level table 12. Besides this method, there are 
also many variations of the method for realizing this 
feature. 

For example, it is possible to store the information 
25 regarding the congestion status at plural locations, such 
as at plural input side transfer units 2 coimnonly. Using 
the congestion level table 12 commonly, each input side 
transfer unit 2 can use the congestion status obtained by 
the other input side transfer unit 2 in addition to the 
30 congestion status obtained by this input side transfer unit 
2 itself, so that more congestion statuses can be 
ascertained and there is an advantage that a probability 
for being capable of dealing with the congestion 
immediately becomes high. 
35 Also, as another method, as shown in Fig. 6, it is 
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possible to (attach the priority level not accounting for 
the congestion status to the packet at the input side 
transfer unit 2 and) change the priority level attached to 
the packet to that accounting for the congestion status at 
5 a packet priority level changing unit 42 provided inside 
the switching unit 4, Instead of attaching the priority 
level accounting for the congestion status to tEe packet at 
the input side transfer unit 2. 

In this case, the congestion status transferred from 

10 the output side transfer unit 6 is notified to the packet 
priority level changing unit 42 inside the switching unit 
4, The packet priority level changing unit 42 obtains the 
congestion level from the notified congestion status and 
stores the obtained congestion level into a congestion 

15 level table 52, and changes the priority level of the 
packet according to the destination of the packet that 
passes through there. It is not absolutely necessary for a 
single packet priority level changing unit 52 to handle all 
the output side transfer units 6 of the packet switch. By 

20 providing a plurality of packet priority level changing 

units 52 in a single switching unit 4 such that each packet 
priority level changing unit 52 handles only the nearby 
output side transfer units 6, the processing can be 
distributed and the implementation can be made easier. 

25 In this case, the priority level according to the 

congestion status of the destination can be attached to the 
packet at a location closer to that destination, so that 
there is an advantage that the priority level can be 
attached to the packet effectively. 

30 

Referring now to Fig, 7, the second embodiment of a 
packet switch and a packet switching method according to 
the present invention will be described in detail. 
In the first embodiment, the priority level is 
35 controlled according to the congestion level, whereas in 
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a class level scheduling setting unit 24, a transfer packet 
selection unit 26, and a congestion level table 32, 

In the example shown in Fig. 7, it is assumed that the 
congestion level and the class are to be used in the 
5 scheduling. For this reason, Fig. 7 shows the configuration 
in which a congestion level field and a class field are 
provided in the congestion level table 32. 

In the following, this second embodiment will be 
described in further detail. 

10 Packets that are waiting for the transfers inside the 

input side transfer unit 2 can be transferred efficiently 
by classifying them Into groups where members of each group 
share the same congestion status, and providing 
corresponding packet queues 22. For example, the packet 

15 queue 22 is provided for each packet flow (or virtual 

connection). As another example, it is also effective to 
provide the packet queue 22 for each class of each output 
port of each output side transfer unit 6 of the packet 
switch. More simply, it is also possible to provide the 

20 packet queue 22 for each class of each output side transfer 
unit 6, Note however that the classification according to 
classes is unnecessary in the case of a packet switch that 
does not have a concept of classes. 

Here it is assumed that the packet queue 22 is 

25 provided and managed for each flow. 

When the congestion status is transferred from the 
output side transfer unit 6, the input side transfer unit 2 
stores/updates the congestion level of each flow in the 
congestion level table 32 accordingly. When each flow has 

30 an attribute of class, it is also convenient to store this 
information in the same table 32. 

The transfer packet selection unit 26 of Fig. 7 refers 
to the congestion level table 32 and the class level 
scheduling setting unit 24 at a time of selecting a packet 

35 to be transferred. The class level scheduling setting unit 
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24 sets a packet selection policy among classes, such as 
that the transfer rate of the real time class (RT of Flg» 
7) is limited to be 80 % of the maximum packet transmission 
rate from the input side transfer unit 2 at most and 
5 packets of the real time class are outputted at a hlg-her 
priority over packets of the best effort class (BE of Fig. 
7), for example. 

In this second embodiment, the transfer packet 
selection unit 26 first carries out (1) a selection of a 

10 class according to the setting content stored in this class 
level scheduling setting unit 24, then (11) a selection of 
a flow destined to the non-congested destination as much as 
possible, from flows of the selected class. 

At this point, ideally speaking, the flow destined to 

15 the least congested destination should be selected, but it 
is not necessarily so easy to select the flows strictly in 
an order of their congestion levels depending on the 
convenience of the implementation. However, it is possible 
to achieve a sufficient effect by selecting the flow 

20 destined to the non-congested destination more often than 
the flow destined to the congested destination. As a 
simplest possible implementation, it is possible to roughly 
classify the flows into three groups according to their 
congestion levels, and select a flow from the least 

25 congested group. In this way, a flow is not necessarily 
selected strictly in an order of their congestion levels 
within the same group, but flows can be selected in roughly 
increasing order of the congestion levels overall. 

When a flow is selected, the transfer packet selection 

30 unit 26 next carries out (ill) a transfer of packets to the 
switching unit 4 starting from a top of the packet queue 22 
of that flow. 

Thereafter, the above operations (1) to (ill) are 
repeatedly executed. 

35 As described, according to this second embodiment, the 
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input side transfer unit 2 transfers packets at a higher 
priority to the non-congested destinations, so that the 
problem of affecting the flow of packets destined to the 
non-congested destinations due to the influence of the flow 
5 that is already affected by the congestion can be easily 
resolved . 

Also, in this second embodiment, when a packet is 
transferred, the latest congestion status of the flow is 
notified in response to the transfer of that packet, so 
10 that there is an advantage that the selection priority 
level of that flow is dynamically changed to the 
appropriate value according to the latest congestion 
status • 

Note that this second . embodiment (the packet selection 
15 in the scheduling) can be practiced in combination with the 
first embodiment (the packet selection in the case of 
collision) • 

In such a case, it is also possible to expect the 
following effects. Namely, if there are only packets 

20 destined to the congested destinations inside the input 
side transfer unit 2, these packets would be outputted 
consecutively toward the congested destinations, but these 
packets have a high probability of being discarded and re- 
transmitted. However, when the second embodiment is 

25 combined with the first embodiment, the priority levels 
according to the congestion levels are attached to the 
packets so that the outputting of these packets does not 
cause any problem. Such a configuration has an advantage 
that it can be realized more simply than a control for 

30 suppressing the transfers of the packets destined to the 
congested destinations. In the control for suppressing the 
transfers, there is a need to carry out the strict control 
in order to prevent the lowering of the utilization 
efficiency of the switching unit 4 due to the excessive 

35 suppression, whereas this configuration only requires a 



-32- 



much simpler control. 

Also, when the second embodiment is combined with the 
first embodiment, the congestion level table <and a portion 
to obtain the congestion level from the notified congestion 
5 status) can be given in a configuration in which a portion 
for attaching the priority level to the packet and a 
portion to be used for the scheduling are independently 
provided, or in a configuration in which these portions are 
integrally provided in a single congestion level table. 

10 Note also that, in the case of Implementing the first 

and/or second embodiments described above, it Is preferable 
to define a valid period for the congestion level. For 
example, at the input side transfer unit 2, if the new 
congestion status is not notified for a prescribed period 

15 of time after the congestion level of a given flow is 

determined, that old congestion level can be invalidated. 

Referring now to Fig. 8, the third embodiment of a 
packet switch and a packet switching method according to 

20 the present invention will be described in detail. 

In the first and second embodiments, the features for 
preventing the flows of packets destined to the non- 
congested output ports or the like from being affected by 
the congestion of the other unrelated portion have been 

25 mainly described. 

In this third embodiment, a mechanism for notifying a 
request to lower the packet transmission rate to a user who 
is transmitting packets that are causing the congestion 
will be described. 

30 This third embodiment is applicable to any of the 

first embodiment, the second embodiment, and a combination 
of the first and second embodiments, but it is also 
possible to practice this third embodiment independently 
(that is, this third embodiment is applicable regardless of 

35 a configuration of the switching unit), and by practicing 
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this third embodiment in combination with the first 
embodiment and/or the second embodiment, it is possible to 
construct a packet switch with an excellent comprehensive 
congestion handling: mechanism. 
5 Fig. 8 shows an exemplary configuration of a multiple 

stage switch type packet switch according to the third 
embodiment, along with the outline of the operation of this 
switch, 

A function for notifying the congestion to the user 

10 has conventionally been proposed or practiced in the single 
stage switch type packet switch. However, it has been 
difficult to apply this function effectively to the 
multiple stage switch type packet switch with a large 
number of ports. In the following, as a method for 

15 notifying the congestion to the user, three major methods 
used in the conventional single stage switch type packet 
switch will be described first. 

The first method for notifying the congestion is a 
method in which the congestion is notified by the packet 

20 discarding. For example, in the protocol called TCP that is 
widely used in the Internet, when the packet discarding is 
detected, a control to make the transfer rate (a window 
size to be accurate) smaller is activated in order to 
relieve the congestion. In the discarding control for the 

25 packet switch that handles packets transferred by TCP, the 
method called RED (Random Early Detection) which increases 
the packet discarding probability as the congestion becomes 
worse is generally considered superior. In the case of 
notifying the congestion by the packet discarding, it is 

30 known that the congestion can be notified to the user 

quicker by discarding the top packet of the packet queue 
rather than discarding the tail packet of the packet queue, 
and in this way a possibility of relieving the congestion 
before it becomes severe can be made higher. 

35 The second method for notifying the congestion is a 
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method usin& the congestion experience notification, in 
which a congestion experience field is provided in a part 
of a packet header and this field is marked in a packet 
that has experienced the congestion so that the user can be 
5 notified of the congestion when this field is marked. In 
the case of ATM coimnunications , this method is called EFCI 
(Explicit Forward Congestion Indication). Also, "in the 
Internet, this method is called ECN (Explicit Congestion 
Notification) which Is currently under the process of 

10 standardization. Similarly as in the case of the packet 
discarding based method, in the case of the congestion 
experience notification, it is known that the congestion 
can be notified to the user quicker by marking the top 
packet of the packet queue rather than marking the tail 

15 packet of the packet queue. 

The third method for notifying the congestion is a 
method using BECN (Backward Explicit Congestion 
Notification). In the case of ATM communications, there is 
a proposition in which a switch generates a special cell (a 

20 packet is called a cell in ATM coimnunications) called a 
BECN cell in order to notify the congestion when the 
congestion occurs and transfers this BECN cell toward a 
transmitting side user. A processing for loading 
information onto a resource management cell of an ABR 

25 service category in ATM is similar in a sense that it also 
transfers Information directly toward a transmitting side 
user. In the Internet, there is also a similar control 
message called Source Quench. In the case of BECN, the 
congestion is directly notified to an upstream side user 

30 rather than a downstream side user, so that there is no 
need to carry out anything special with respect to a top 
packet of the packet queue. However, at a time of 
determining a user to which BECN is to be transmitted, it 
is preferable to make a judgement at a location where the 

35 congestion is monitored, i.e., at an output side of the 
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packet switch, from a viewpoint of realizing the fair 
congestion notification. This is because the congestion 
notification should be made with respect to a user who is 
most responsible for causing the congestion at each 
5 congested location. 

In the above, three major methods for the congestion 
notification have been described. What is commoii to all 
these methods is that it is better to carry out the 
congestion control at the output of the queue as much as 

10 possible. When this factor is taken into consideration in 
the packet switch of each embodiment described above, it 
can be said that it is preferable to carry out the 
congestion control at the output side transfer unit 6 
rather than the input side transfer unit 2. In this regard, 

15 in the conventional multiple stage switch type packet 

switch, it has been impossible for the output side transfer 
unit 6 to accurately comprehend the congestion status 
inside the input side transfer unit 2, so that it has been 
difficult to comprehend the congestion status of the packet 

20 switch as a whole. If it is a packet switch with the lesser 
number of ports such as a single stage switch type packet 
switch, it has been possible to realize connections such 
that each output side transfer unit 6 can monitor the 
congestion statuses of all the input side transfer unit 2, 

25 but if it is a packet switch with the large number of ports 
such as a multiple stage switch type packet switch, it has 
been difficult to realize such connections. 

As a method for resolving this problem, as shown in an 
example of Fig. 8, it is possible to consider a method in 

30 which the congestion status inside the input side transfer 
unit 2 is notified to the output side transfer unit 6 along 
with a packet, and the output side transfer unit 6 carries 
out the congestion control by judging the comprehensive 
congestion status from the congestion status inside the 

35 input side transfer unit 2, in the packet switch in which a 
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In general, in a packet switch, it is necessary to 
calculate a connection pattern for connecting the input 
side transfer unit and the output side transfer unit at 
high speed. For example, in a multiple stage switch as 
5 shown in Fig. 1. the connection pattern for connecting the 
input side transfer unit and the output side transfer unit 
is effectively calculated as autonomous distribdted routing 
of packets is carried out and a route of a packet that 
survived the collision is obtained. On the other hand, in 

10 the cross-bar type switch. Instead of adopting such an 

autonomous distributed approach, a centralized engine for 
calculating the connection pattern exists at one location 
in the switch. In general, the amount of calculations of 
this engine is the order of when the number of ports in 

15 the switch is N, and it is known that the calculation 

becomes more difficult as the number of ports is increased. 
This fourth embodiment is applicable to an algorithm of 
such a connection pattern calculation engine. 

Fig. 9 shows an exemplary configuration of a cross-bar 

20 type packet switch according to the fourth embodiment, 
along with the outline of the operation of this packet 
switch. Fig. 9 shows an exemplary case corresponding to the 
first embodiment. 

The basic overall configuration of the multiple stage 

25 switch type packet switch of this fourth embodiment is 
similar to that of Fig. 1 as in the first embodiment. In 
that a packet entered from an input port is transferred 
from an input side transfer unit 2 via a cross-bar switch 3 
to a desired output side transfer unit 6, and outputted 

30 from there to an output port. Also, similarly as in the 

first embodiment, the number of input ports and the number 
of output ports in this packet switch are arbitrary, a 
plurality of input ports or output ports may be set in 
correspondence to one input side transfer unit or output 

35 side transfer unit, the internal network configuration 
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Inside the switching unit of this packet switch is also 
arbitrary, and the number of inputs and the number of 
outputs of each switching element that constitutes that 
network are also arbitrary. In this case, the switching- 
5 element can be an LSI that integrates one or more cross 
points. 

Note that the priority level attaching unit, the 
congestion status monitoring unit, the congestion level 
table, and the packet selection unit with respective 

10 functions similar to those of the first embodiment are also 
provided, but they are assumed to be incorporated into an 
algorithm of a connection pattern calculation engine 7. 
However, in the case of using a separate hardware as in the 
case of using a counter for the purpose of monitoring the 

15 congestion status at the congestion status monitoring unit, 
for example, the corresponding portion is to be implemented 
similarly as in the embodiments described above. Any 
portion other than a portion that is inevitably 
Incorporated into the algorithm of the connection pattern 

20 calculation engine 7 such as the control and processing to 
be carried out within the switching unit may also be 
Implemented in a manner similar to that of the embodiments 
described above . 

Now, upon receiving a data transfer request of the 

25 input side transfer unit 2, the connection pattern 

calculation engine 7 calculates the connection pattern of 
the cross-bar switch 3, Here, as shown in Fig. 9, the 
information regarding the congestion status is sent from 
the output side transfer unit 6 to the connection pattern 

30 calculation engine 7. 

This connection pattern calculation engine 7 holds 
(that is, carries out a simulation of) a tournament among 
data transfer requests from the input side transfer units 2 
using its internal software, and notifies the connection 

35 pattern to the cross-bar switch 3 such that data 
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corresponding to requests that survived the tournament will 
be exchanged. As a tournament table to be used here, a 
prescribed topology (such as a topology shown in Fig. 1, 
for example) can be used. 
5 In the case of using the topology shown in Fig. 1, the 

tournament is held by using the topology of the routing 
network by rearranging requests into a random order by 
using the topology of the random network. 

In the competition of the tournament, the surviving 

10 request is determined according to the priority level that 
is set or changed by accounting for the congestion level in 
a congestion level table 92. 

When a set of requests for which the transfer towards 
the output side transfer units 6 are to be permitted is 

15 obtained in this manner, the connection pattern is 

calculated and data are exchanged by the cross-bar switch 3 
that is set up by the calculated connection pattern. 

In the case of using the topology shown in Fig. 1, 
when the number of ports in the switch is N, there are N 

20 degrees of freedom along a vertical direction and logN 
degrees of freedom along a horizontal direction so that 
requests for which the transfer towards the output side 
transfer units 6 are to be permitted can be obtained by the 
calculations of the order of N-logN. Thus the calculation 

25 can be carried out relatively fast, and there is also an 
advantage that the fairness can be secured because the 
congestion statuses of the output side transfer units 6 are 
accounted for. 

Note that the configuration in which this fourth 

30 embodiment is applied to a combination of the first 

embodiment and the second embodiment and/or the third 
embodiment can also be realized similarly. 

As described above, according to the present 
35 invention, the priority level according to the congestion 
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status of the transfer target Is attached to a packet, and 
the processing at a time of packet collision is carried out 
by accounting for this priority level, so that it becomes 
possible to carry out the packet transfer control according 
5 to the congestion status of the transfer target of each 
packet. For example, by attaching a relatively high 
priority level to packets destined to non-congested ports, 
it is possible to prevent the flows of packets destined to 
the non-congested ports from being affected by the 

10 influence of packets destined to a congested port. 

Also, according to the present invention, by attaching 
a higher priority level to a once discarded packet, it is 
possible to make the once discarded packet less likely to 
be discarded again at a time of re-transmission, 

15 Also, according to the present invention, by attaching 

a lower priority level to a packet corresponding to a top 
portion of data than packets corresponding to subsequent 
portions of data, it is possible to transfer a plurality of 
packets subdividing the data collectively once the packet 

20 corresponding to the top portion of data reaches to the 
transfer target. 

It is to be noted that, in each embodiment described 
above, the case of applying the present invention to a 

25 multiple stage switch type packet switch has been mainly 
described, but the present invention is also applicable to 
a single stage switch type packet switch. Also, the present 
invention is utillzable as a method for handling the 
congestion of the packet network as a whole. 

30 It is also to be noted that it is possible to realize 

a part or a whole of a packet switch of the present 
invention by a processor (CPU) operated by software. For 
example, each input side transfer unit, each output side 
transfer unit, or both may be realized by one or a 

35 plurality of CPUs. 
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By realizing each input side transfer unit or each 
output side transfer unit of the packet switch by CPU, it 
becomes possible to set up and control the management of 
the information regarding packets such as the transfer 
5 scheduling algorithm, the packet buffer allocation 

algorithm, the packet header format, the congestion status, 
etc., using software. In particular, it is possible to 
realize the control regarding the setting of the priority 
level according to the congestion status (the monitoring of 

10 the congestion status at the output side, the transfer of 
the congestion status to the input side, the management of 
the congestion Information at the input side, the setting 
of the priority level of the packet, etc.) that is the one 
of the major features of the present invention, by 

15 software. 

Using CPU, it becomes possible to change the control 
scheme so that the system can be made flexible. On the 
other hand, using hardware instead of CPU, it becomes 
possible to realize the necessary control at relatively low 

20 cost. A judgement as to whether to use hardware or software 
should be made by in view of such a trade off relationship, 
but the present invention is effectively functional 
regardless of whether a part or a whole of the present 
Invention is realized using hardware or software. 

25 It is also possible to realize the input side transfer 

unit and the corresponding output side transfer unit having 
the same port as that input side transfer unit by an 
identical LSI, and implement the present invention by one 
or a plurality of CPUs incorporated into that LSI. It is 

30 also possible to realize the input/output side transfer 
units corresponding to a plurality or ports by a single 
CPU. 

As should be apparent from the above, the above 
described embodiments according to the present invention 
35 may be conveniently implemented using a conventional 
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general purpose digital computer programmed according to 
the teachings of the present specification, as will be 
apparent to those skilled in the computer art. Appropriate 
software coding can readily be prepared by skilled 
programmers based on the teachings of the present 
disclosure, as will be apparent to those skilled in the 
software art. In particular, the packet switch of each of 
the above described embodiments can be conveniently 
Implemented In a form of a software package. 

Such a software package can be a computer program 
product which employs a storage medium Including stored 
computer code which is used to program a computer to 
perform the disclosed function and process of the present 
invention. The storage medium may include, but is not 
limited to, any type of conventional floppy disks, optical 
disks, CD-ROMs, magneto-optical disks, ROMs, RAMs , EPROMs , 
EEPROMs, magnetic or optical cards, or any other suitable 
media for storing electronic instructions. 

It is also to be noted that, besides those already 
mentioned above, many modifications and variations of the 
above embodiments may be made without departing from the 
novel and advantageous features of the present invention. 
Accordingly, all such modifications and variations are 
intended to be Included within the scope of the appended 
claims. 
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